WORLD INTELLECHJAL PROPERTY ORGANIZATION 
Intenutkmtl Bureau 




PCX 

INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCD 



(51) International Patent Classfflcation ^ : 

C12N 9708, 15/53, 15/63, 1/21, 15A)9, 
C12P l/PO, C12Q 1V30 



Al 



(11) Inteniatlottal Publication Number: WO 98A)0526 

(49) International Publication Date: 8 January 1998 (08.01.98) 



(21) International Application Number: PCT/US97/ 165 1 3 

(22) International FUlng Date: 3 July 1997 (03.07.97) 



(30) Priority Data: 
08/674.887 



3 July 1996 (03.07.96) 



US 



(81) Designated States: AU, CA. JP, European patent (AT. BE, 
CH, DE. DK, ES, FI, FR. GB. GR. IE, FT. LU. MC, NL, 
PT, SE). 



Published 



(71) Applicant: RECOMBINANT BIOCATALYSIS, INC 
[US/US]: 505 Coast Boulevard South, La Jolla. CA 92037 
(US). 

(72) Inventors: ROBERTSON, Dan, E.; 33 Evergreen Lane, Had- 

donfield, NJ 08033 (US). SANYAL. Indrajit; H8. Pickwick 
Apartments. Maple Shade. NJ 08052 (US). ADHIKARY. 
Robert, S.\ 1 1 Hoffman Avenue, Chcny Hill, NJ 08003 
(US). 

(74) Agent: HAILE, Lisa, A.; Fish & Richardson P.C.. Suite 1400, 
4225 Executive Square. U Jolla. CA 92037 (US). 



With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



($4)TiUe: CATALASES 
(57) Abstract 

Catalase enzymes derived from bacterial for the geoera Alcaligenes (Delaya) and MicroscUla arc disclosed. The enzymes are produced 
from native or recombinant host cells and can be utilized to destroy or detect hydrogen peroxide, e.g., in production of glyoxyllc acid and 
in glucose sensors, and in processes where hydrogen peroxide is used as a bleaching or antibacterial agent. e.g.. in contact lens cleaning, 
in bleaching steps in pulp and paper preparation and in the pasteurization of dairy products. 



FOR THE PURPOSES OP INFORMATION ONLY 



Codes used to identify States patty to the PCT on the front pages of pamphleu publishing international applications under the PCT. 



AL 


Mbtaok 


ES 


Spftin 


LS 


Leaotbo 


SI 


Slovenia 


AM 


Annenia 


n 


Fmland 


LT 




SK 


Slovakia 


AT 


Aintiu 


Fit 


Prmco 


LU 


Liucmbooif 


SN 


Senetal 


AU 


AuKiiUa 


OA 


Gdxn 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azertitiju 


GB 


Uiked KiDfibmi 


MC 


Monaco 


TD 


Chad 


BA 


Dooiit tnd Herzejovin* 


GE 


CmrfU 


MD 


Republic or Mokkwa 


TG 


Togo 


BB 




GH 


Ohaaa 


MC 


Madagucar 


TJ 


T^^daaa 


BE 


BelfttBD 


GN 


Ouinei 


MK 


Tlio fofiiUi YofOilav 


TM 


Taifrntf iiktio 


BP 




CR 






Republic of Macedoua 


TR 


TtBfcejr 


BG 


BtttgMb 


HU 


HiBtaiy 


ML 


Mali 


TT 


1>iakiad ud Tnfa^ 


Bl 




IE 




MN 


MomoUi 


UA 


Utoaiae 


BR 


Bnul 


IL 


bnel 


MR 


Mainilania 


VG 


Uganda 


BY 


BehnB 


18 




MW 


Malawi 


US 


UniHd Statea of America 


CA 


Ctudi 


IT 




MX 


Mexico 


uz 


UsbeUdaa 


CF 


CcBtnl Afrkta RcpoUk 


JP 


Japan 


NE 




VN 


Viet Nam 


CG 




KE 


Kcaya 


ML 


Ndheitaods 


YU 


YufotUvia 


CH 


Switzerlnd 


KG 


KyijTzttaQ 


NO 


Norway 


zw 


Zimhabwe 


a 


CAte d'lvoiie 


KP 




NZ 


NewZodand 






CM 






Republic ofltami 


PL 


Poland 






CN 




KR 


RqiQblk of Kixot 


PT 


Ptttupl 






CU 


Cuba 


KZ 




RO 


Romaaia 






CZ 


Czech RefwUic 


LC 


Sunt LocU 


RU 


Rstaiaa Fedenlion 






DE 


GerroiDy 


U 


LkBttfitttfin 


5D 


Sudan 






DK 
EE 


Deninvfe 
Eitcnta 


LK 
LR 


SriLMka 
Ubcfia 


St 
SG 


Slnjapon 







wo 98/00526 



PCT/US97/16513 



- 1 . 
CATALASES 

Field of the Invcntinn 

This invention relates generally to enzymes and more specifically to 
catalases and polynucleotides encoded such catalases, including methods of use. 

5 Background 

This invention relates to newly identified polynucleotides, polypeptides 
encoded by such polynucleotides, the use of such polynucleotides and polypeptides, 
as well as the production and isolation of such polynucleotides and polypeptides. 
More particularly, the polynucleotides and polypeptides of the present invention have 

1 0 been putatively identified as catalases. 

Generally, in processes where hydrogen peroxide is a by-product, 
catalases can be used to destroy or detect hydrogen peroxide, e.g., in production of 
glyoxylic acid and in glucose sensors. Also, in processes where hydrogen peroxide is 
used as a bleaching or antibacterial agent, catalases can be used to destroy residual 

1 5 hydrogen peroxide, e.g. in contact lens cleaning, in bleaching steps in pulp and paper 
preparation and in the pasteurization of dairy products. Further, such catalases can be 
used as catalysts for oxidation reactions, e.g, epoxidation and hydroxylation. 
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Summary of the Invention 

In accordance with one aspect of the present invention, there are provided 
novel enzymes, as well as active fragments, analogs and derivatives thereof. 

In accordance with another aspect of the present invention, there are 
5 provided isolated nucleic acid molecules encoding the enzymes of the present 
invention including mRNAs, cDNAs, genomic DNAs as well as active analogs and 
fragments of such enzymes. 

In accordance with yet a further aspect of the present invention, there is 
provided a process for producing such polypeptides by recombinant techniques 
1 0 comprising culturing recombinant prokaryotic and/or eukaryotic host cells, containing 
a nucleic acid sequence of the present invention, under conditions promoting 
expression of said enzymes and subsequent recovery of said enzymes. 

In accordance with yet a further aspect of the present invention, there are 
also provided nucleic acid probes comprising nucleic acid molecules of sufficient 
1 5 length to specifically hybridize to a nucleic acid sequence of the present invention. 

In accordance with yet a further aspect of the present invention, there is 
provided a process for utilizing such enzymes, or polynucleotides encoding such 
enzymes, for in vitro purposes related to scientific research, for example, to generate 
probes for identifying similar sequences which might encode similar enzymes from 
20 other organisms by using certain regions, i.e., conserved sequence regions, of the 
nucleotide sequence. 

In accordance with yet a further aspect of the present invention, there is 
provided antibodies to such catalases. These antibodies are as probes to screen 
libraries from these or other organisms for members of the libraries which could have 
25 the same catalase activity or a cross reactive activity. 

In another embodiment, the invention provides a method for catalyzing an 
oxidation reaction comprising contacting a substrate with an effective amount of an 
enyzme selected from the group consisting of an amino acid sequence set forth in 
SEQ ID NOS: 7 or 9, thereby catalyzing an oxidation reaction. Another method of 
30 the invention includes the detection and/or destruction of hydrogen peroxide in a 
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sample comprising contacting the sample with an effective amount of an enzyme 
having an amino aeid sequence set forth in SEQ ID N0:7 or SEQ ID NO:9, and 
detecting the presence of hydrogen peroxide in the sample. Hydrogen peroxide acts 
as a substrate for catalases, thus, either the detection and/or the destruction of 
hydrogen peroxide is achieved by combining a sufficient amount of the catalases of 
the invention with a sample or material suspected of containing hydrogen peroxide. 

These and other aspects of the present invention should be apparent to 
those skilled in the art from the teachings herein. 



Brief Description of the Drawings 

The following drawings are illustrative of an embodiment of the invention 
and are not meant to limit the scope of the invention as encompassed by the claims. 

Figure I shows the full-length DNA sequence and the corresponding 
deduced amino acid sequence for Alcaligenes (Deleya) aquamarinus Catalase - 
64CA2. 

Figure 2 shows the full-length DNA sequence and the corresponding 
deduced amino acid sequence for Microscilla Jurvescens Catalase 53CA 1 . 



Detailed Description of Preferred Embodiments 

In order to facilitate understanding of the following description and 
examples which follow certain frequently occurring methods and/or terms will be 
20 described. 

The term "isolated" means altered "by the hand of man" from its natural 
state; i,e., if it occurs in nature, it has been changed or removed from its original 
environment, or both. For example, a naturally occurring polynucleotide or a 
polypeptide naturally present in a living animal in its natural state is not "isolated", 
25 but the same polynucleotide or polypeptide separated from the coexisting materials of 
its natural state is "isolated", as the term is employed herein. For example, with 
respect to polynucleotides, the term isolated means that it is separated from the 
nucleic acid and cell in which it naturally occurs. 
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As part of or following isolation, such polynucleotides can be joined to 
other polynucleotides, such as DNAs, for mutagenesis, to form fusion proteins, and 
for propagation or expression in a host, for instance. The isolated polynucleotides, 
alone or joined to other polynucleotides such as vectors, can be introduced into host 
5 cells, in culture or in whole organisms. Introduced into host cells in culture or in 
whole organisms, such polynucleotides still would be isolated, as the term is used 
herein, because they would not be in their naturally occurring form or environment. 
Similarly, the polynucleotides and polypeptides may occur in a composition, such as a 
media formulation (solutions for introduction of polynucleotides or polypeptides, for 
1 0 example, into cells or compositions or solutions for chemical or enzymatic reactions 
which are not naturally occurring compositions) and, therein remain isolated 
polynucleotides or polypeptides within the meaning of that term as it is employed 
herein. 

The term "ligation" refers to the process of forming phosphodiester bonds 

1 5 between two or more polynucleotides, which most often are double stranded DNAs. 
Techniques for ligation are well known to the art and protocols for ligation are 
described in standard laboratoiy manuals and references, such as, for instance, 
Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed.; 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1 989). 

20 The term "gene" means the segment of DNA involved in 4producing a 

polypeptide chain; it includes regions preceding and following the coding region 
(leader and trailer) as well as intervening sequences (introns) between individual 
coding segments (exons). 

A coding sequence is "operably linked to" another coding sequence w^en 

25 RNA polymerase will transcribe the two coding sequences into a single mRNA, 
which is then translated into a single polypeptide having amino acids derived from 
both coding sequences. The coding sequences need not be contiguous to one another 
so long as the expressed sequences ultimately process to produce the desired protein. 

"Recombinant" enzymes refer to enzymes produced by recombinant DNA 

30 techniques; i.e., produced from cells transformed by an exogenous DNA construct 
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encoding the desired enzyme. nSynthetic" enzymes are those prepared by chemical 
synthesis. 

A DNA "coding sequence of or a "nucleotide sequence encoding" a 
particular enzyme, is a DNA sequence which is transcribed and translated into an 
5 enzyme when placed under the control of appropriate regulatory sequences. 

"Plasmids" are designated by a lower case "p" preceded and/or followed 
by capital letters and/or numbers. The starting plasmids herein are either 
commercially available, publicly available on an unrestricted basis, or can be 
constructed from available plasmids in accord with published procedures. In addition. 
10 equivalent plasmids to those described are known in the art and will be apparent to 
the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a 
restriction enzyme that acts only at certain sequences in the DNA. The various 
restriction enzymes 

1 5 used herein are commercially available and their reaction conditions, cofactors and 
other requirements were used as would be known to the ordinarily skilled artisan. For 
analytical purposes, typically 1 ^g of plasmid or DNA fragment is used with about 2 
units of enzyme in about 20 ^1 of buffer solution. For the purpose of isolating DNA 
fragments for plasmid constniction, typically 5 to 50 ug of DNA are digested with 20 
20 to 250 units of enzyme in a larger volume. Appropriate buffers and substrate amounts 
for particular restriction enzymes are specified by the manufacturer. Incubation times 
of about 1 hour at 37.C are ordinarily used, but may vary in accordance with the 
supplier's instnactions. After digestion the reaction is electrophoresed directly on a 
polyaciylamide gel to isolate the desired fragment. 
25 Size separation of the cleaved fragments is perfomied using 8 percent 

polyacrylamide gel described by Goeddel et al.. Nucleic Acids Res,, 5:4057 (1 980). 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide 
or two complementary polydeoxynucleotide strands which may be chemically 
synthesized. Such synthetic oligonucleotides have no 5' phosphate and thus will not 
30 ligate to another oligonucleotide without adding a phosphate with an ATP in the 



SUBSTITUTE SHEET (RULE 26) 



WO98A>0526 



PCT/US97/16513 



-6- 

presence of a kinase. A synthetic oligonucleotide will ligate to a fragment that has not 
been dephosphorylated. 

"Ligation" refers to the process of forming phosphodiester bonds between 
two double stranded nucleic acid fragments (Maniatis, T., et al,, Id., p. 146). Unless 
5 otherwise provided, ligation may be accomplished using known buffers and 
conditions with 10 units of T4 DNA ligase ("ligase") per 0.5 ng of approximately 
equimolar amounts of the DNA fragments to be ligated. 

Unless otherwise stated, transformation was performed as described in 
Sambrook and Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring 
1 0 Harbor Uboratory, 1 989. 

hi accordance with an aspect of the present invention, there are provided 
isolated nucleic acids (polynucleotides) which encode for the mature enzyme having 
the deduced amino acid sequence of Figure 1 (SEQ ID NO: 7). 

In accordance with another aspect of the present invention, there are 
1 5 provided isolated nucleic acids (polynucleotides) which encode for the mature 
enzyme having the deduced amino acid sequence of Figure 2 (SEQ ID NO: 9). 

hi accordance with another aspect of the present invention, there is 
provided an isolated polynucleotide encoding the enzyme of the present invention. 
The deposited material is a genomic clone comprising DNA encoding an enzyme of 
20 the present invention. As de|X)sited with the American Type Culture Collection, 
12301 Parklawn Drive, Rockvilie, Maryland 20852, USA, the deposited material is 
assigned ATCC Deposit No. 

The deposit has been made under the temis of the Budapest Treaty on the 
International Recognition of the Deposit of Micro-organisms for Purposes of Patent 
25 Procedure. The clone will be irrevocably (without restriction or condition) released to 
the public upon the issuance of a patent. This deposit is provided merely as 
convenience to those of skill in the art and is not an admission that a deposit would be 
required xmder 35 U.S.C. § 1 1 2. The sequence of the polynucleotide contained in the 
deposited material, as well as the amino acid sequence of the polypeptide encoded 
30 thereby, are controlling in the event of any conflict with any description of sequences 
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herein. A license may be required to make, use or sell the deposited material, and no 
such license is hereby granted. 

The polynucleotides of this invention were originally recovered from a 
genomic gene library derived from two sources. The first, Alcaligenes (Delaya) 
5 aquamarinus, is a P-Proteobacteria. It is a gram-negative rod that grows optimally at 
26° C and pH 7.2. The second, Microscillafurvescens, is a Cytophagales (Bacteria) 
isolated from Samoa. It is a gram-negative rod with gliding motility that grows 
optimally at 30** C and pH 7.0. 

With respect to Alcaligenes (Delaya) aquamarinus, the protein with the 
1 0 closest amino acid sequence identity of which the inventors are currently aware is the 
Microscillafurvescens catalase (59.5 % protein identity; 60 % DNA identity). The 
next closest is a Mycobacterium tuberculosis catalase (KatG), with a 54 % protein 
identity. 

With respect to Mcroscillajurvescens, the protein with the closest amino 
1 5 acid sequence identity of which the inventors are currently aware is catalase I of 
Bacillus stearothermophilas, which has a 69% amino acid identity. 

Accordingly, the polyoucleotides and enzymes encoded thereby are 
identified by the organism from which they were isolated. Such are sometimes 
referred to below as "64CA2" (Figure 1 and SEQ ID NOS: 6 and 7) and "53CAI" 
20 (Figure 2 and SEQ ID NOS : 8 and 9). 

One means for isolating the nucleic acid molecules encoding the enzymes 
of the present invention is to probe a gene library with a natural or artificially 
designed probe using art recognized procedures (see, for example: Current Protocols 
in Molecular Biology, Ausubel F.M. et al. (EDS.) Green Publishing Company Assoc. 
25 and John Wiley Interscience, New York, 1989, 1992). It is appreciated by one skilled 
in the ait that the polynucleotides of SEQ ID NOS: 6 and 8, or fragments thereof 
(comprising at least 12 contiguous nucleotides), are particularly useful probes. Other 
particularly usefril probes for this purpose are hybridizable fragments of the 
sequences of SEQ ID NOS: 6 and 8 (i.e., comprising at least 12 contiguous 
30 nucleotides). 
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With respect to nucleic acid sequences which hybridize to specific nucleic 
acid sequences disclosed herein, hybridization may be carried out under conditions of 
reduced stringency, medium stringency or even stringent conditions. As an example 
of oligonucleotide hybridization, a polymer membrane containing immobilized 
5 denatured nucleic acids is first prehybridized for 30 minutes at 45**C in a solution 
consisUng of 0.9 M NaCl, 5.0 mM NaH2P04, pH 7.0, 5.0 mM NajEDTA, 0.5% SDS, 
lOX Denhardt's, and 0.5 mg/mL polyriboadenylic acid. Approximately 2X10' cpm 
(specific activity 4-9 X 10' cpm/ug) of ^^p end-labeled oligonucleotide probe are then 
added to the solution. After 1216 hours of incubation, the membrane is washed for 30 

1 0 minutes at room temperature in 1 X SET ( 1 50 mM NaCl, 20 mM Tris hydrochloride, 
pH 7.8, 1 mM Na2EDTA) containing 0.5% SDS, followed by a 30 minute wash in 
fresh IX SET at (Tm less 10°C) for the oligonucleotide probe. The membrane is then 
exposed to auto-radiographic film for detection of hybridization signals. 

Stringent conditions means hybridization will occur only if there is at least 

1 5 90% identity, preferably at least 95 % identity and most preferably at least 97% 
identity between the sequences. Further, it is understood that a section of a 100 bps 
sequence that is 95 bps in length has 95% identity with the 1090 bps sequence from 
which it is obtained. See J, Sambrook et ai. Molecular Cloning, A Laboratory 
Manual 2d Ed„ Cold Spring Harbor Laboratory (1989) which is hereby incorporated 

20 by reference in its entirety. Also, it is understood that a fi-agment of a 1 00 bps 

sequence that is 95 bps in length has 95% identity vAxh the 100 bps sequence fixjm 
which it is obtained. 

As used herein, a first DNA (RNA) sequence is at least 70% and 
preferably at least 80% identical to another DNA (RNA) sequence if there is at least 

25 70% and preferably at least a 80% or 90% identity, respectively, between the bases of 
the first sequence and the bases of the another sequence, when properly aligned with 
each other, for example when aligned by BLASTN. 

The present invention relates to polynucleotides which differ from the 
reference polynucleotide such that the differences are silent, for example, the amino 

30 acid sequence encoded by the polynucleotides is the same. The present invention also 
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relates to nucleotide changes which result in amino acid substitutions, additions, 
deletions, fusions and truncations in the polypeptide encoded by the reference 
polynucleotide. In a preferred aspect of the invention these polypeptides retain the 
same biological action as the polypeptide encoded by the reference polynucleotide. 
5 The polynucleotides of this invention were recovered from genomic gene 

libraries from the organisms identified above. Gene libraries were generated from a 
Lambda ZAP II cloning vector (Stratagene Cloning Systems). Mass excisions were 
performed on these libraries to generate libraries in the pBluescript phagemid. 
Libraries were generated and excisions were performed according to the 
1 0 protocols/methods hereinafter described. 

The polynucleotides of the present invention may be in the form of RNA 
or DN A, which DNA includes cDNA, genomic DNA, and synthetic DN A. The DNA 
may be double stranded or single-stranded, and if single stranded may be the coding 
strand or non-coding (anti-sense) strand. The coding sequences which encodes the 
1 5 matxire enzymes may be identical to the coding sequences shown in Figures 1 -2 (SEQ 
ID NOS: 6 & 8) or may be a different coding sequence which coding sequence, as a 
result of the redundancy or degeneracy of the genetic code, encodes the same mature 
enzymes as the DNA of Figures 12 (SEQ ID NOS: 6 & 8). 

The polynucleotide which encodes for the mature enzyme of Figures 1-2 
20 (SEQ ID NOS: 7 & 9) may include, but is not limited to: only the coding sequence for 
the mature enzyme; the coding sequence for the mature enzyme and additional coding 
sequence such as a leader sequence or a proprotein sequence; the coding sequence for 
the mature enzyme (and optionally additional coding sequence) and non-coding 
sequence, such as introns or noncoding sequence 5' and/or 3' of the coding sequence 
25 for the mature enzyme. 

Thus, the term "polynucleotide encoding an enzyme (protein)" 
encompasses a polynucleotide which includes only coding sequence for the enzyme 
as well as a polynucleotide which includes additional coding and/or non-coding 
sequence. 
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The present invention further relates to variants of the hereinabove 
described polynucleotides which encode for fragments, analogs and derivatives of the 
enzymes having the deduced amino acid sequences of Figures 1-2 (SEQ ID NOS: 7 & 
9). The variant of the polynucleotide may be a naturally occurring allelic variant of 
5 the polynucleotide or a nonnaturally occurring variant of the polyoucleotide. 

Thus, the present invention includes polynucleotides encoding the same 
mature enzymes as shown in Figures 1-2 (SEQ ID NOS: 7 & 9) as well as variants of 
such polynucleotides which variants encode for a fragment, derivative or analog of 
the enzymes of Figures 1-2 (SEQ ID NOS: 7 & 9). Such nucleotide variants include 
1 0 deletion variants, substitution variants and addition or insertion variants. 

As hereinabove indicated, the polynucleotides may have a coding 
sequence which is a naturally occurring allelic variant of the coding sequences shown 
in Figures 1-2 (SEQ ID NOS: 6 & 8). As known in the art, an allelic variant is an. 
alternate form of a polynucleotide sequence which may have a substitution, deletion 
1 5 or addition of one or more nucleotides, which does not substantially alter the function 
of the encoded enzyme. Also, using directed and other evolution strategies, one may 
make very minor changes in DNA sequence which can result in major changes in 
function. 

Fragments of the full length gene of the present invention may be used as 
20 hybridization probes for a cDNA or a genomic libraiy to isolate the full length DNA 
and to isolate other DNAs which have a high sequence similarity to the gene or 
similar biological activity. Probes of this type preferably have at least 10, preferably 
at least 1 5. and even more preferably at least 30 bases and may contain, for example, 
at least 50 or more bases. In fact, probes of this type having at least up to 1 50 bases or 
25 greater may be preferably utilized. The probe may also be used to identify a DNA 
clone corresponding to a fiill length transcript and a genomic clone or clones that 
contain the complete gene including regulatory and promoter regions, exons and 
introns. An example of a screen comprises isolating the coding region of the gene by 
using the known DNA sequence to synthesize an oligonucleotide probe. Labeled 
30 oligonucleotides having a sequence complementary or identical to that of the gene or 
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portion of the gene sequences of the present invention are used to screen a library of 
genomic DNA to determine which members of the library the probe hybridizes to. 

It is also appreciated that such probes can be and are preferably labeled 
with an analytically detectable reagent to facilitate identification of the probe. Useful 
5 reagents include but are not limited to radioactivity, fluorescent dyes or enzymes 
capable of catalyzing the formation of a detectable product. The probes are thus 
useful to isolate complementary copies of DNA from other sources or to screen such 
sources for related sequences. 

The present invention further relates to polynucleotides which hybridize to 
1 0 the hereinabove-described sequences if there is at least 70%, preferably at least 90%, 
and more preferably at least 95% identity between the sequences. (As indicated 
above, 70% identity would include within such definition a 70 bps fragment taken 
from a 100 bp polynucleotide, for example.) The present invention particularly relates 
to polynucleotides which hybridize under stringent conditions to the hereinabove- 
1 5 described polynucleotides. As herein used, the term "stringent conditions" means 
hybridization will occur only if there is at least 95 % and preferably at least 97% 
identity between the sequences. The polyoucleotides which hybridize to the 
hereinabove described polynucleotides in a prefen^d embodiment encode enzymes 
which either retain substantially the same biological function or activity as the mature 
20 enzyme encoded by the DNA of Figures 1 -2 (SEQ ID NOS : 6 & 8). In referring to 
identity in the case of hybridization, as known in the art, such identity refers to the 
complementarily of two polynucleotide segments. 

Alternatively, the polynucleotide may have at least 15 bases, preferably at 
least 30 bases, and more preferably at least 50 bases which hybridize to any part of a 
25 polynucleotide of the present invention and which has an identity thereto, as 

hereinabove described, and which may or may not retain activity. For example, such 
polynucleotides may be employed as probes for the polynucleotides of SEQ ID NOS: 
6 & 8, for example, for recovery of the polyoucleotide or as a diagnostic probe or as a 
PCR primer. 
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Thus, the present invention is directed to polynucleotides having at least a 
70% identity, preferably at least 90% identity and more preferably at least a 95% 
identity to a polynucleotide which encodes the enzymes of SEQ ID NOS: 7 & 9 as 
well as fiagments thereof, which fragments have at least 1 5 bases, preferably at least 
5 30 bases, more preferably at least 50 bases and most preferably fragments having up 
to at least 1 50 bases or greater, which fi:agments are at least 90% identical, preferably 
at least 95% identical and most preferably at least 97% identical to any portion of a 
polynucleotide of the present invention. 

The present invention further relates to enzymes which have the deduced 
1 0 amino acid sequences of Figures I -9 (SEQ ID NOS: 28-36) as well as fragments, 
analogs and derivatives of such enzyme. 

The terms "fragment,n nderivative" and "analog" when referring to the 
enzymes of Figures 1 -9 (SEQ ID NOS. 28-36) means enzymes which retain 
essentially the same biological function or activity as such enzymes. Thus, an analog 
1 5 includes a proprotein which can be activated by cleavage of the proprotein portion to 
produce an active mature enzyme. 

The enzymes of the present invention may be a recombinant enzyme; a 
natural enzyme or a synthetic enzyme, preferably a recombinant enzyme. 

The fragment, derivative or analog of the enzymes of Figures 1-2 (SEQ ID 
20 NOS : 7 & 9) may be (i) one in which one or more of the amino acid residues are 
substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may 
not be one encoded by the genetic code, or (ii) one in which one or more of the amino 
acid residues includes a substituent group, or (iii) one in which the mature enzyme is 
25 fused with another compound, such as a compound to increase the half-life of the 
enzyme (for example, polyethylene glycol), or (iv) one in which the additional amino 
acids are fiised to the mature enzyme, such as a leader or secretory sequence or a 
sequence which is employed for purification of the mature enzyme or a proprotein 
sequence. Such fragments, derivatives and analogs are deemed to be within the scope 
30 of those skilled in the art from the teachings herein. 
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The enzymes and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

The present invention also relates to vectors which include 
polynucleotides of the present invention, host cells which are genetically engineered 
5 with vectors of the invention and the production of enzymes of the invention by 
recombinant techniques. 

Host cells are genetically engineered (transduced or transformed or 
transfected) v^dth the vectors of this invention which may be, for example, a cloning 
vector such as an expression vector. The vector may be, for example, in the form of a 
1 0 plasmid, a phage, etc. The engineered host cells can be cultured in conventional 
nutrient media modified as appropriate for activating promoters, selecting 
transformants or amplifying the genes of the present invention. The culture 
conditions, such as temperature, pH and the like, are those previously used with the 
host cell selected for expression, and will be apparent to the ordinarily skilled artisan. 
* 5 The polynucleotides of the present invention may be employed for 

producing enzymes by recombinant techniques. Thus, for example, the polynucleotide 
may be included in any one of a variety of expression vectors for expressing an 
enzyme. Such vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; 
20 yeast plasmids; vectors derived fitjm combinations of plasmids and phage DNA, viral 
DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, any 
other vector may be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a 
variety of procedures. In general, the DNA sequence is inserted into an appropriate 
25 restriction endonuclease site(s) by procedures known in the art. Such procedures and 
others are deemed to be within the scope of those skilled in the art. 

The DNA sequence in the expression vector is opcratively linked to an 
appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As 
representative examples of such promoters, there may be mentioned: LTR or SV40 
30 promoter, the E. colL lac or trp, the phage lambda promoter and other promoters 
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known to control expression of genes in prokaryotic or eukaryotic cells or their 
viruses. The expression vector also contains a ribosome binding site for translation 
initiation and a transcription terminator. The vector may also include appropriate 
sequences for amplifying expression, 
5 In addition, the expression vectors preferably contain one or more 

selectable marker genes to provide a phenotypic trait for selection of transformed host 
cells such as dihydrofolate reductase or neomycin resistance for eukaryotic cell 
culture, or such as tetracycline or ampicillin resistance in E. coli. 

The vector containing the appropriate DN A sequence as hereinabove 

1 0 described, as well as an appropriate promoter or control sequence, may be employed 
to transform an appropriate host to ptnaix the host to express the protein. 

As representative examples of appropriate hosts, there may be mentioned: 
bacterial cells, such as £. coli, Streptomyces, Bacillus subtilis; fungal cells, such as 
yeast; insect cells such as Drosophila S2 and Spodoptera Sf9; animal cells such as 

1 5 CHO, COS or Bowes melanoma; adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those skilled in the art Irom the 
teachings herein. 

More particularly, the present invention also includes recombinant 
constructs comprising one or more of the sequences as broadly described above. The 

20 constructs comprise a vector, such as a plasmid or viral vector, into which a sequence 
of the invention has been inserted, in a forward or reverse orientation. In a preferred 
aspect of this embodiment, the construct further comprises regulatory sequences, 
including, for example, a promoter, operably linked to the sequence. Large numbers 
of suitable vectors and promoters are known to those of skill in the art, and are 

25 commercially available. The following vectors are provided by way of example; 
Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pBluescript H KS(Stratagene), ptrc99a, 
pKK223-3, pDR540, pRIT2T (Pharmacia); Eukaryotic: pXTl, pSG5 (Stratagene) 
pSVK3, pBPV, pMSG, pSVL SV40 (Pharmacia). However, any other plasmid or 
vector may be used as long as they are replicable and viable in the host. 

30 Promoter regions can be selected from any desired gene using CAT 
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(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 
include lad. lacZ, T3, T7, apt, lambda PR, PL and tip. Eukaryotic promoters include 
CMV immediate early, HSV thymidine kinase, early and late S V40, LTRs from 
5 retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and 
promoter is well within the level of ordinary skill in the art. 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher eukaryotic 
cell, such as a mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the 
1 0 host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the 

construct into the host cell can be effected by calcium phosphate transfection, DEAE- 
Dextran mediated transfection, or electroporation (Davis, L., Dibner, M., Battey, I., 
Basic Methods in Molecular Biology, ( 1 986). 

The constructs in host cells can be used in a conventional manner to 
1 5 produce the gene product encoded by the recombinant sequence. Alternatively, the 
enzymes of the invention can be synthetically produced by conventional peptide 
synthesizers. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation systems 

20 can also be employed to produce such proteins using RNAs derived from the DNA 
constructs of the present invention. Appropriate cloning and expression vectore for 
use with prokaryotic and eukaryotic hosts are described by Sambrook et aL, 
Molecular Cloning: A Laboratory Manual Second Edition, Cold Spring Harbor, 
N.Y., (1989), the disclosure of which is hereby incorporated by reference. 

25 Transcription of the DNA encoding the enzymes of the present invention 

by higher eukaryotes is increased by inserting an enhancer sequence into the vector. 
Enhancers are cisacting elements of DNA, usually about from 1 0 to 300 bp that act on 
a promoter to increase its transcripUon. Examples include the SV40 enhancer on the 
late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter 

30 enhancer, the polyoma enhancer on the late side of the replication origin, and 
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adenovirus enhancers. 
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Generally, recombinant expression vectors will include origins of 
replication and selectable markers peraiitting transformation of the host cell, e.g., the 
ampicillin resistance gene of K coli and S. cerevisiae TRPl gene, and a promoter 
derived from a highlyexpressed gene to direct transcription of a downstream 
5 structiu-al sequence. Such promoters can be derived from operons encoding glycolytic 
enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or 
heat shock proteins, among others. The heterologous structural sequence is assembled 
in appropriate phase with translation initiation and teimination sequences, and 
preferably, a leader sequence capable of directing secretion of translated enzyme. 
1 0 Optionally, the heterologous sequence can encode a fiision enzyme including an N- 
terminal identification peptide imparting desired characteristics, e.g., stabilization or 
simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable translation 
1 5 initiation and teimination signals in operable reading phase with a fimctional 

promoter. The vector will comprise one or more phenotypic selectable markers and an 
origin of replication to ensure maintenance of the vector and to, if desirable, provide 
amplification within the host. Suitable prokaryotic hosts for transformation include E. 
coli. Bacillus subtilis, Salmonella typhimurium and various species within the genera 
20 Pseudomonas, Streptomyces, and Staphylococcus, although others may also be 
employed as a matter of choice. 

As a representative but nonlimiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication 
derived from commercially available plasmids comprising genetic elements of the 
25 well known cloning vector pBR322 (ATCC 3701 7). Such commercial vectors 

include, for example, pKK223-3 (Pharmacia Fine Chemicals. Uppsala. Sweden) and 
pGEMI (Piomega Biotec. Madison, Wl, USA). These pBR322 "backbone" sections 
are combined with an appropriate promoter and the structural sequence to be 
expressed. 

30 Following transformation of a suitable host strain and growth of the host 
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strain to an appropriate cell density, the selected promoter is induced by appropriate 
means (e.g., temperatiire shift or chemical induction) and cells are cultured for an 
additional period. 

Cells are typically harvested by centrifligation, disrupted by physical or 
5 chemical means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, mechanical 
disruption, or use of cell lysing agents, such methods are well known to those skilled 
in the art. 

10 Various manunalian cell culture systems can also be employed to express 

recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described by Gluzman, Cell. 23: 175 (1981), and 
other cell lines capable of expressing a compatible vector, for example, the CI 27, 
3T3, CHO, HeLa and BHK cell lines. Mammalian expression vectors will comprise 

1 5 an origin of replication, a suitable promoter and enhancer, and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, 
transcriptional termination sequences, and 5' flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be used to 
provide the required nontranscribed genetic elements. 

20 The enzyme can be recovered and purified from recombinant cell cultures 

by methods including ammonium sulfate or edianol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Protein refolding steps can be used, as 

25 necessary, in completing confi-uration of the mature protein. Finally, high 

performance liquid chromatography (HPLC) can be employed for final purification 
steps. 

The enzymes of the present invention may be a naturally purified product, 
or a product of chemical synthetic procedures, or produced by recombinant 
30 techniques from a prokaryotic or eukaryotic host (for example, by bacterial, yeast. 
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higher plant, insect and mammalian cells in culture). Depending upon the host 
employed in a recombinant production procedure, the enzymes of the present 
invention may be glycosylated or may be non-glycosylated. Enzymes of the invention 
may or may not also include an initial methionine amino acid residue. 
5 Antibodies generated against the enzymes corresponding to a sequence of 

the present invention can be obtained by direct injection of the enzymes into an 
animal or by administering the enzymes to an animal, preferably a nonhuman. The 
antibody so obtained will then bind the enzymes itself In this manner, even a 
sequence encoding only a fragment of the enzymes can be used to generate antibodies 
1 0 binding the whole native enzymes. Such antibodies can then be used to isolate the 
enzyme from cells expressing tliat enzyme. 

The term "antibody," as used herein, refers to intact immunoglobulin 
molecules, as well as fragments of immunoglobulin molecules, such as Fab, Fab', 
(Fab')2, Fv, and SCA fragments, that are capable of binding to an epitope of an 
15 endoglucanase polypeptide. These antibody fragments, which retain some ability to 
selectively bind to the antigen {e,g,, an endoglucanase antigen) of the antibody fr^om 
which they are derived, can be made using well known methods in the art (see, e.g., 
Harlow and Lane, supra), and are described fiirther, as follows. 

(1) A Fab fragment consists of a monovalent antigen-binding fragment of an 

20 antibody molecule, and can be produced by digestion of a whole antibody molecule 
with the enzyme papain, to yield a fragment consisting of an intact light chain and a 
portion of a heavy chain. 

(2) A Fab* firagment of an antibody molecule can be obtained by treating a whole 
antibody molecule with pepsin, followed by reduction, to yield a molecule consisting 

25 of an intact light chain and a portion of a heavy chain. Two Fab' fragments are 
obtained per antibody molecule treated in this manner. 

(3) A (Fab')2 fragment of an antibody can be obtained by treating a whole antibody 
molecule with the enzyme pepsin, without subsequent reduction. A (Fdb^ fragment 
is a dimer of two Fab' fragments, held together by two disulfide bonds. 
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(4) An Fv fragment is defined as a genetically engineered fragment containing the 
variable region of a light chain and the variable region of a heavy chain expressed as 
two chains. 

(5) A single chain antibody ("SCA") is a genetically engineered single chain molecule 
5 containing the variable region of a light chain and the variable region of a heavy 

chain, linked by a suitable, flexible polypeptide linker. 

As used in this invention, the term "epitope" refers to an antigenic 
determinant on an antigen, such as an endoglucanase polypeptide, to which the 
paratope of an antibody, such as an endoglucanase-specific antibody, binds. 

1 0 Antigenic determinants usually consist of chemically active surface groupings of 
molecules, such as amino acids or sugar side chains, and can have specific three- 
dimensional structural characteristics, as well as specific charge characteristics. 

For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include 

1 5 the hybridoma technique (Kohler and Milstein, Nature, 256:495-497, 1 975), the 
trioma technique, the human B-cell hybridoma technique (Kozbor et al.. Immunology 
Today 4:72, 1983), and the EBV-hybridoma technique to produce human monoclonal 
antibodies (Cole et al., in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
hic, pp. 77-96. 1985). 

20 Techniques described for the production of single chain antibodies (U.S. 

Patent 4,946,778) can be adapted to produce single chain antibodies to immunogenic 
enzyme products of this invention. Also, transgenic mice may be used to express 
humanized antibodies to immunogenic enzyme products of this invention. 

Antibodies generated against an enzyme of the present invention may be 

25 used in screening for similar enzymes from other organisms and samples. Such 
screening techniques are known in the art, for example, one such screening assay is 
described in Sambrook and Maniatis, Molecular Cloning: A Laboratory Manual (2d 
Ed.), vol. 2:Section 8.49, Cold Spring Harbor Laboratory, 1989, which is hereby 
incorporated by reference in its entirety. 
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The present invention will be further described with reference to the 
following examples; however, it is to be understood that the present invention is not 
limited to such examples. All parts or amounts, unless otherwise specified, are by 
weight. 

5 Example 1 

Production of the Expression Gene Bank 

An £, coli catalase negative host strain CAT500 was infected with a phage 
solution containing sheared pieces of DNA from Alcaligenes (Deleya) aquamarinus 
in pBluescript plasmid and plated on agar containing LB with ampicillin (100 

1 0 -g/mL), methicillin (80 --g/mL) and kanamycin (1 00 --g/mL) according to the method 
of Hay and Short (Hay, B. and Short, J., 1 Strategies, 5: 1 6, 1 992). The resulting 
colonies were picked with sterile toothpicks and used to singly inoculate each of the 
wells of 96-well microtiter plates. The wells contained 250 ,uL of SOB media with 
100 -g/mL ampicillin, 80 ~g/mL methicillin, and (SOB Amp/Meth/Kan). The cells 

1 5 were grown overnight at vdthout shaking. This constituted generation of the 
"SourceGeneBankn; each well of the Source GeneBank thus contained a stock culture 
of coli cells, each of which contained a pBluescript plasmid with a unique DNA 
insert. Same protocol was adapted for screening catalase from Microscilla furvescens. 

Example 2 

20 Screening f or Catalase Activity 

The plates of the Source GeneBank were used to multiply inoculate a single plate (the 
"Condensed Plate") containing in each well 200 of SOB Amp/Meth/Kan. This 
step was performed using the High Density Replicating Tool (HDRT) of the Beckman 
Biomek with a I % bleach, water, isopropanol, air-dry sterilization cycle in between 
25 each inoculation. Each well of the Condensed Plate thus contained 4 different 
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pBluescript clones from each of the source libraiy plates. Nine such condensed plates 
were prepared and grown for 1 6h at 37**C. 

One hundred (100) ^iL of the overnight culture was transferred to the white 
poiyfiltronic assay plates containing 100 pL Hepes/well. A 0.03% solution of 
5 hydrogen peroxide was made in 5 % Triton and 20 \iL of this solution was added to 
each well. The plates were incubated at room temperature for one hour. After an hour, 
50 of 120 mM 3-(p-hydroxyphenyl)-propionic acid and 1 unit of horseradish 
peroxidase were added to each well and the plates were incubated at room 
temperature for 1 hour. To quench the reaction. 50 ,jiL of 1 M Tris-base was added to 
1 0 each well . The wells were excited on a fluorometer at 320 nm and read at 404 nm. A 
low value signified a positive catalase hit. 

Example 3 

Isolation a nd Purification of the Active Clone 

hi order to isolate the individual clone which carried the activity, the 
1 5 Source GeneBank plates were thawed and the individual wells used to singly 
inoculate a new plate containing SOB Amp/Meth/Kan. As above the plate was 
incubated at 37**C to grow the cells, and assayed for activity as described above. Once 
the active well from the source plate was identified, the cells (mm the source plate 
were streaked on agar with LB/Amp/Meth/Kan and grown overnight at 37°C to obtain 
20 single colonies. Eight single colonies were picked with a sterile toothpick and used to 
singly inoculate the wells of a 96well microtiter plate. The wells contained 250 pL of 
SOB Amp/Meth/Kan. The cells were grown overnight at 37°C without shaking. A 
100 \iL aliquot was removed from each well and assayed as indicated above. The 
most active clone was identified and the remaining 150 ^iL of culture was used to 
25 streak an agar plate vnth LB/Amp/Meth/Kan. Eight single colonies were picked, 
grown and assayed as above. Tbe most active clone was used to inoculate 3mL 
cultures of LB/Amp/Meth/Kan, which were grown overnight. The plasmid DNA was 
isolated from the cultures and utilized for sequencing. 
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Example 4 
Expression of Catalascs 

DNA encoding the enzymes of the present invention, SEQ ID NOS: 7 and 
9, were initially amplified from a pBluescript vector containing the DNA by the PGR 
5 technique using the primers noted herein. The amplified sequences were then inserted 
into the respective pQE vector listed beneath the primer sequences, and the enzyme 
was expressed according to the protocols set forth herein. The 5' and 3' 
oligonucleotide primer sequences used for subcloning and vectors for the respective 
genes are as follows: 

1 0 Alcaligenes (Deleya) aquamarinus catalse: (pQET vector) 
5' Primer 

CCGAGAATTCATTAAAGAGGAGAAATTAACTATGAATAACGCATCCGCTG 
AC EcoRI (SEQ ID NO: 1) 

3 • Primer CGGAAAGCnTTACGACGCGACGTCGAAACG Hindi 1 1 (SEQ ID 
15 N0:2) 

Microscilla furvescens catalase: (pQET vector) 
5' Primer 

CCGAGAATTCATTAAAGAGGAGAAATTAACTATGGAAAATCACAAACACT 
CA EcoRI (SEQ IDN0:3) 

20 3' Primer CGAAGGTACCTTATTTCAGATCAAACCGGTC Kpnl (SEQ ID N0:4) 
The restriction enzyme sites indicated correspond to the restriction enzyme 
sites on the bacterial expression vector indicated for the respective gene (Qiagen, Inc. 
Chatsworth, CA). The pQET vector encodes antibiotic resistance (Ampr), a bacterial 
origin of replication (on), an IPTG-regulatable promoter operator (P/0), a ribosome 
25 binding site (RBS), a 6-His tag and restriction enzyme sites. 

The pQET vector was digested with the restriction enzymes indicated. The 
amplified sequences were ligated into the respective pQET vector and inserted in 
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frame with the sequence encoding for the RBS. The native stop codon was 
incorporated so the genes were not fused to the His tag of the vector. The ligation 
mixture was then used to transfonn the E. cold strain UM255tpREP4 (Qiagen, Inc.) 
by electroporation. UM255/pREP4 contains multiple copies of the plasmid pREP4, 
5 which expresses the lacl repressor and also confers kanamycin resistance (Kanr). 
Transformants were identified by their ability to grow on LB plates and 
ampicillin>^anamycin resistant colonies were selected. Plasmid DNA was isolated and 
confirmed by restriction analysis. Clones containing the desired constructs were 
grown overnight (0/N) in liquid culture in LB media supplemented with both Amp 
10 (100 u ^/ml) and Kan (25 u ^yml). The 0/N culture was used to inoculate a large 
culture at a ratio of 1 : 1 00 to 1 :250. The cells were grown to an optical density 600 
(O.D.*") of between 0.4 and 0.6. IPTG ("Isopropyl-B-D-thiogalacto pyranosiden") 
was then added to a final concentration of 1 mM. IPTG induces by inactivating the 
lacl repressor, clearing the P/0 leading to increased gene expression. Cells were 
1 5 grown an extra 3 to 4 hours. Cells were then harvested by centrifligation. The primer 
sequences set out above may also be employed to isolate the tm-get gene from the 
deposited material by hybridization techniques described above. 
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What Is Claimed Is: 



1 . Substantially pure catalase having an amino acid sequence of SEQ ID 
NO:7orSEQIDNO:9 

2. An isolated polynucleotide sequence encoding a catalase of claim 1 . 

3. An isolated polynucleotide selected from the group consisting of: 



a) SEQ ID:6 or SEQ ID N0:8; 

b) SEQ ID:6 or SEQ ID N0:8, wherein T can also be U; 

c) nucleic acid sequences complementary to a) and b); and 

d) fragments of a), b), or c) that are at least 1 5 bases in length and that 
will selectively hybridize to DNA which encodes the amino acid 
sequences of SEQ ID Nos:7 or 9, respectively. 



4. The polynucleotide of claim 2, wherein the polynucleotide is isolated 
from a prokaryote. 

5. An expression vector including the polynucleotide of claim 2. 

6. The vector of claim 5, wherein the vector is a plasmid. 

7. The vector of claim 5, wherein the vector is a virus-derived. 

8. A host cell transformed with the vector of claim 5. 

9. The host cell of claim 8, wherein the cell is prokaryotic. 

10. Antibodies that bind to the polypeptide of claim 1 . 
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1 1 . The antibodies of claim 1 0, wherein the antibodies are polyclonal. 

12. The antibodies of claim 10, wherein the antibodies are monoclonal. 

13. An enzyme comprising a member selected from the group consisting of: 

a) an enzyme comprising an amino acid sequence which is at least 
70% identical to the amino acid sequence set forth in SEQ ID 
N0:7 or SEQ ID N0:9; and 

b) an enzyme which comprises at least 30 amino acid residues to an 
enzyme of a). 

1 4. A method for producing an enzyme comprising growing a host cell of 
claim 8 under conditions which allow the expression of the nucleic acid 
and isolating the enzyme encoded by the nucleic acid. 

15. A process for producing a cell comprising: transforming or transfecting the 
cell with the vector of Claim 5 such that the cell expresses the polypeptide 
encoded by the DNA contained in the vector. 

16. A method for catalyzing an oxidation reaction comprising contacting a 
substrate with an effective amount of an enyzme selected from the group 
consisting of an amino acid sequence set forth in SEQ ID NOS: 7 or 9, 
thereby catalyzing an oxidation reaction. 

17. A method for detection or destruction of hydrogen peroxide in a sample 
comprising contacting the sample with an effective amount of an enzyme 
having an amino acid sequence set forth in SEQ ED N0:7 or SEQ ID 
N0:9, and detecting 

the presence of hydrogen peroxide in the sample. 
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FIGURE 1 

Alcaligenes (Deleya) aquamarinus Catalaao - 64ca2 

1 ATO AAT AAC OCA TCX OCT OAC OAT CTA CAC At3T AOC TTO CAO CAA AOA TOC AOA OCA TTT (0 

1 Mec A«n Aan Al« S«r Al« Asp kmp Leu Hi« Sor S«t L«u Oln Oln Arg Cy* Arg Kim Ph« 20 

(1 err ccc rro ota tco cca aoo okt aoa oca ata ago oao aqa ocr ato aoc ogt aaa tct iio 

21 V«l Pro V*l aar Pro Arg Hi« Arg Ala lie Arg Olu Arg Al« Met Ser Cly Lyo Cya 40 

121 CCT GTQ AXa CAC OCT OCT AAC ACC TCG ACC OCT ACT TCC AAC AAA CAT TOO TOO CCC CAA 1«0 

41 Pro Vml Mac Hio Oly Oly Aan Thr fler Thr Oly Thr 3«r Aan tya Aap Trp Trp Pro Olu to 

lai cca TTO AAC CrO oat ATT TTQ cat CAO CAA oat COC aaa TCA OAC ccc ato oat coo cat 240 

fil Cly Lau Aan Lau Aap Ila Lau Him Cln Oln Aap Arg Lya Sar Aap Pro N*e Aap Pro Aap 10 

241 TTC AAC TAC CCT OAA OAA OTA CCC AAO CTC OAT TTC SAC CCC CTO AAC AAA GAT CTC CAC 300 

tl Pha Aan Tyr Arg Glu olu Val Arg Lya Leu Aap Phe Aap Ala Leu Ly» Lya Aap Val Hi a 100 

3 01 OCO TTO ATO ACC OAT ACC CAA CAO TCO TOO CCC OCT OAC TOO OOO CAC TAC OOC GOT TTO 3<0 

101 Ala Leu Met Thr Aap Sar Cln Olu Trp Trp Pro Ala Aap Trp Cly Kia Tyr Oly Cly Leu 120 

3<l ATO ATC CGT ATO OCT TOO CAC TCC OCT OOC ACC TAC COT ATT OCT GAT OOC COT OOO CCC 420 

121 Met tie Arg Met Ala Trp Hie Ser Ala Oly Thr Tyr Arg He Ala Aap Oly Arg Oly Oly 140 

421 CGT CCT ACC CCA ACC CAO CCC TIT CCA CCO CTC AAC TCC TOC CCO CAC AAC CTC ACC CTO 480 

141 Oly Oly Thr Oly Sar Cln Arg Phe Aia Pro Leu Aan Ser Trp Pro Aap Aan Val Sar Leu ISO 

4fll CAT AAA CCG CCC OCT CTO CTO TOO CCC ATC AAO AAC AAO TAC OGC AAC AAA ATC AOC TCO 540 

1*1 A«p Lya Ale Arg Arg Leu Leu Trp Pro He Lya Lya Lya Tyr Oly Aan Lye lie aer Trp l»0 

S41 CCA CAC CTO ATO ATT CTO OCT OOC ACC OTO CCT TAT OAC TCC ATC CCC TTA CCT OCT TAC COO 

!•! Ala Aep Leu Met He Leu Ale Oly Thr V*l Ala Tyr Olu Ser Hat Cly L*u Pro Ale Tyr 200 

COl CCC TTC TCr TIC CCC CCC CTC CAT ATT TOO CAA CCC CAA AAA OAT ATC TAC TCO CCT CAC CCO 

201 Cly Pha Ser Phe Oly Arg Val Aap He Trp Olu Pro Olu Lya Aap He Tyr Trp Oly A«p 220 

<S1 CAA AAA CAC TCG CTC OCA CCT TCT OAC OAA COC TAC CCC OAC CTO AAC AAO CCA CAO ACC 720 

221 Clu Lya Clu Trp Leu Ala Pro Ser Aap Olu Arg Tyx Cly Aap Val Aen Lya Pro Olu Thr 240 

721 ATO CAA AAC CCO CTO CCD OCT CTC CAA ATO OOT CTO ATC TAT CTO AAC CCC CAA OCT CTT 750 

241 Met Clu Aen Pro Leu Ala Ala Val Oln Hec Oly Leu Il« Tyx Val Aan Pro Olu Cly Vel 3C0 

761 AAC OCC CAC CCT OAT CCO CTO AOA ACC OCA CAO CAO OTA CTT OAA ACC TTC CCC CGT ATO 140 

a«l Aen Oly Hla 9TO Asp Pro Leu Arg Thr Ale Gin Oln Val Leu Ciu Thr Phe Ala Arg Mac 210 

• 41 OCC ATO AAC OAC OAA AAA ACC OCA OCC CTC ACA OCT OOC OOC CAC ACC OTC OCT AAT TCT SOO 

a>l Ala Mac Aen Aap Olu Lya Thr Ala Ala Leu Thr Ala Cly Oly Hla Thr Val Oly Aan Cya 300 

901 CAC COT AAT OCC AAT OCC TCT OOO TTA OCC CCT OAC CCA AAA OCC TCT CAC OTT CAA AAC )<0 

301 Hie Oly Aan Oly Aen Ala lar Ala Lau Ala Pro Aap Pro Lya Ala Ser Aap Val Clu Aan 320 

»<1 CAO OOC TTA OOT TOO OOC AAC CCC AAC ATO CAO OOC AAO OCA AOC AAC OCC OTO ACC TCO 1020 

321 Cln Oly Leu Cly Trp Oly Aan Pro Aan Hec Oln Oly Lya Ala Sar Aen Ala Vel Thr Ser 3 40 

1021 OOT ATC OAA GOT OCT TOO ACC ACC AAC CCC ACO AAA TTC OAT ATO OOC TAT TTC OAC CTC lOaO 

341 Oly He Olu Oly Ale Trp Thr Thr Aan Pro Thr Lye Phe Aap Met Oly Tyr Phe Aap Leu 3C0 
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lOBl CTC TTC OOC TAC AAT TOO OAA CTO AAA AAO AOT CCT OCC OOT OCC CAC CAT TOO QAA CCO 1140 
3fil L«u PFt* Oly T/r A«n Trp olu L«u Ly» Lym S«r Pro Al« Oly Aim Hi* Hia Trp Olu Pro 380 

1141 ATT OAC ATC AAA AAO OAA AAC AAO CCO OTT OAC OCC AOC QAC CCC TCT ATT COC CAC AAC 1200 

3 SI He Asp lie Lya Lya Olu Aan Lym Pro Val Aap Al* 5er Aap Pro Sar He Arg Hie Aon 400 

1201 COO ATC ATO ACC OAT OCO OAT ATO OCO ATA AAO OTA AAT CCO ACC TAT COC OCT ATC TGC 1260 

401 Pro lis Met Thr Aap Ala Aep HeC Ala He Lya v«l Aan Pro Thr Tyx Arg Ala Il« Cya 420 

12SI GAA AAA TTC ATO OCC OAT CCT OAO TAC TTC AAO AAA ACT TTC OCO AAO OCO TOO TTC AAO 1320 

421 Olu Lye Phe Hec Ala Aap Pro Olu Tyr Phe Lya Lya Thr Pha Ala Lya Aia Trp Phe Lya 440 

1321 CPS AGO CAC CGT CAC CTO OCC CCO AAA TCA CCT TAC ATC OOC CCO GAA CTO CCO OCA OAA 13 80 

441 Lau Thr Hia Arg Aap Lau Oly Pro Lya Sar Arg Tyr Ha Oly Pro Olu Val Pro Ala Qlu 4C0 

1391 CAC CTO ATT TOO CAA GAC CCO ATT COO OCA OCT AAC ACC OAC TAC TGC OAA OAA OTO CTC 1440 

4 61 Aap Leu Zla Trp Oln Aap Pro I la Pro Ala Oly Aan Thr Aep Tyr Cya Olu Olu V«l Val 480 

1441 AAC CAC AAA XTT CCA CAA ACT CGC CTO ACC ATT ACT CAC ATO CTC TCC ACC CCT TOO GAC ISOO 

481 Lya Oln Lya II* Ala Oln Sar Oly Lau Sar Ila Sar Olu Met Val Ser Thr Ala Trp Aep 500 

ISOl ACT CCC CCT ACT TAT CGC OCT TCC CAT ATO CCC OOC OCT OCT AAC CGT CCC OCC ATT COC 1560 

sai S«r Mm. Axg Thr Tyr Arg Oly s«r Aep Hat Arg Oly Oly Ala Aan Oly Ala Arg Xla Arg S30 

1S«1 TTO CCC CCA GAG AAC GAO TOO CAC COC AAC GAO CCO QAG OOC CTC CCO AAA CTC CTC AOC 1620 

521 Leu Ala Pro Cln Aan Olu Trp Oln Cly Aan Olu Pro Olu Axg Lau Ala Lye Val Lau Sar 540 

1621 ore TAC OAO CAO ATC TCT CCC OAC ACC OCC GOT ACC ATC OCO OAC CTO ATC CTT CTO CCC IfilO 

541 Val Tyr Olu Gin Zla Ser Ala Aep Thr Cly &1« Ser Zl« Ala Aep Val He Val Lau Ala SCO 

1681 OCT AOC CTA CGC ATC OAO AAA OCC GCO AAA OCA CCA OCT TAC OAT CTO COC OTT CCC TTC 1740 

SSI Cly Sar Val Oly Zla Clu Lya Ala Ala Lya Ala Ala Oly Tyr Aap Val Arg Val Pro Phe 5 80 

1741 era AAA COC COT OOC OAT OCO ACC OCC OAO ATO ACC OAC OCA OAC TCC TTC OCA CCO CTO 1800 

581 Leu Lye Cly Ar? Cly Aap Ala Thr Ala Clu Mat Thr Aep Ala Aap Sar Phe Ala Pro Lau COO 

1801 GAO CCO CTO CCC OAT CGC TTC CCC AAC TOO CAO AAO AAA OAO TAT CTO OTO AAO CCG CAA LttSO 

601 Olu Pro Leu Ala Aep Oly Phe Arg Aan Tip Oln Lya Lye Olu Tyr Val Val Lye Pro Olu «20 

1361 GAG ATO CTO CTC OAT CCT CCC CAC CTC ATO CGC TTA ACC CGC CCS CAA ATC ACC CTO CTC 1920 

631 Clu Hec Uu Lau Aap Arg Ala Oln Leu Mat Cly Leu Thr Oly Pro Clu Hat Thr Val Lau 640 

1»21 CTO QGC OCT ATO COC OTA CTO OOC ACC AAC XAT OOT OOC ACC AAA CAC OOC 0IA TTC ACC Xf«0 

641 Lau Oly Oly M«c Axg Val Leu Oly Thr A«n Tyr Oly Oly Thr Lye Xla oly Val Phe Thr C60 

1981 GAT TCT GAA OOC CAO TTO ACC AAC GAC TTT TTT OTO AAC CTO ACC GAT ATO OOO AAC AOC 3040 

«C1 A«p Cya Olu Oly Oln Lau Thr Aan Aap Phe Ph* Val Aan Lau Thr Aap HaC Oly Aan Sar iso 

2041 TCC AAO CCO CTA OOT ACC AAC OCC TAC GAA ATC COC OAC COC AAO ACC OOT OCC CTO AAC 2100 

481 Trp Lya Pro Val Oly Ser Aan Ala Tyr Olu II* Arg Aap Arg Lya Thz Oly Ala Val Ly* 700 

aiOl TOO ACC OCC TOO CCO CTO GAT CTO OTA TTT OCT TCC AAC TCO CTA CTO CCC TCT XAC CCA 31C0 

701 Trp Thr Ala Sar Arg V«I Aep Lau Val Phe Oly Sar Aan Ser Lau Lau Arg Sar Tyr Ala 730 

31C1 GAA OTO TAC OCC CAO OAC OAT AAC QGC OAO AAO TTC CTC AGA GAC TTC CTC OCC CCC TCC 3330 

721 Olu Val Tyr Ala Oln Aap Aap Aan Oly Olu Ly* Ph* Val Arg Aap Phe Val Ala Ala Trp 740 

2331 ACC AAA OTO ATO AAC OCC OAC CCT TTC OAC OTQ OCO TOO TAA 3283 
741 Thr Lye Val Mac Aan Ala Asp Arg Ph* Aap Val Ala Jar End 7S4 
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Mlcromellla. fujnrmacmna C*t»l*»« S3CA1 

1 ATO QW AAT OVC AAA ac TCK OO* TCr TCr ACO TAT AAC ACA AAC ACT OOC CWA AAA TOC 
r M« Clu A.n HI. Ly. Ki, s.r oly s.r s.r Thr Tyr A.n Thr A.n Thr oly Oly Ly. Cy. 

CCr rrr ACC OOA OCTT TCO err AAO CAA AOT OCA OOT OOC OOC ACC AAA AAC AOO OAT TCO 
21 pro Ph. Thr Oly oly aar Ly. Oln S.r Ala Oly Oly oly Thr Ly. A.n Arg A.p Trp 

TOG CCC AAC ATO CTC AAC CTC GCC ATC TtA CCC CAA CAT TCA TCO CTA TCO CAC CCA AAC 
.1 Trp Pro A.n H.t L.u *.n L.u Cly tl. L.u Ar, Oln Hi. S.r S.r L.u S.r A.p Pro A.n 
1.1 CAC ceo GAT TTT CAC TAT GCC OAA OAO rrr AAO AAO CTA OAT CrC OCA CCG OTI AAA A^ 
Z pro A.p Ph. A.P TV. Al. Olu Olu PK. Ly. Ly. L-u A.p L.. Al. Al. V.I Ly. Ly. 

„1 CAC CIO OCA OCC CTA ATO ACA OAT TCA CAO OAC TOO TOO CCA =^ !^ 

.1 A.P I-u Al* Al. L.U H.t Thr A.p S.r 01. A.p Trp Trp Pro Al- Aap Tyr Oly Hi. Tyr 

,01 OOC CCC XTC TIT ATA COC ATO OCO TOO CAC AOC OCC OOC ACC TAC COT ATC OOT GAT OGC 
101 Cly Pro Ph. Ph. II. AT, Al. Trp Hi. »« Al. Oly Thr Tyr Ar, II. Oly A.p Oly 
3« COT OOT OOC OOT OOC TCC «C TCA CAO CQC TTC OCO CCT CTC AAT AOC TOO CCA QAC AAT 
ll! Z «ly Oly OiV Oly S.r Oly s.r Ol„ Ar, Ph. Al. Pro L.u A.n S.r Trp Pro A.p A.n 
«1 OCC AAT CTO OAT AAA GCA COC TTO err CtT TOO CCC ATC **A C« AAA TAC 0« COA AAA «0 
Z Z Z A.P Ly. Al. ^ L.U L.U L.U Trp Pro XI. Ly. Oln Ly. Tyr Oly Ar, Ly. 

• 4.1 ATC TCC TOO OCO OAT CIA ATO ATA CTC AO. OOA A*C OTA OCT CTO GAA ACT ATO «C TTT 
Z iH in Trp Al. A.P L.U «.c XI. L.U Tnr Oly ^ V.l Al. L.U Olu Thr H.. Oly Ph. 
S4I AAA ACT TTT OCT TTT OCA OCT GCC AOA CCA OAT «A TOO «0 OCT OAA CAA «T OTA TAC 
^ ^ in Oly PH. Al. Oly Oly Ar, Al. A.p V.1 Trp Olu P,o Olu Olu A-p V.l Tyr 
.01 TOOOGAOO.CAAACC«:AATOCCTCOOAaACAAaCGCTAra«.CCrCACC=A«aCTCG^ 
!ol Z aiy Xi. ^ A.P Ly. AX, Tyr Olu oly A.p Are olu L.u olu 

„I AAT CCC CIO OOA OCC <«A CAA ATO OOA C« ATC TAT CIA AAC CCC OAA OOA CCC AAC OOC 
Z Z Oly AI. V.1 om ..t OW L-u H. xyr V.1 A-n Pro Olu Oly Pro A.„ Oly 
T21 AAO CO. GAC CCT ATC OCT OCT COG COT OAT ATT CCT OAO ACT rrr 0« CGA ATO OCA A^ 
I" Z A.P pro 11. Al. Al. Al. AT, A.P 11. AT, Olu Thr Ph. Oly Arg H.t Al. H.c 
7.1 AAT OAC 0»A OAA ACC OTO OCT etc ATA OCG OCTT OOA CAC ACC rrc OOA AAA ACC CAT GOT 
IW Z Zl Z Olu Thr V.1 Al. L.U 11. Al. Oly Oly Hi. THr Ph. Oly Ly. Thr Hi. Oly 
.41 OCT OCC OAT OCO GAG AAA TAT OTO OOC COA OIU; CCt OCC OCC OCA OCT ATT OAA OAA ATO 
Z Z "p Al. Olu Ly. Tyr V.X Oly Arg olu Pro Al. Al. Al. Oly U. Olu Olu ... 
,01 ACC CTO GOO TOO AAA AAC ACC TAC OOC ACC OOA CAC GOT oca OAT ACC ATC ACC ACrr OCA 

!ci Z Z Oly Trp A.n Thr Tyr oly Thr Oly Hi. Oly AU A.p Thr 11. Thr ..r Oly 

,.1 CTA OAA OGC OCC TOO ACC AAG ACC CCT ACT C*A TOO AGC AAT AAC TTT TTT QAA AAC CTC 
Z.T.Z Trp Tnr TH. pro TK, Gin Trp S.r A.n A.n Ph. Ph. Olu A.n L-u 
TTrOOTtACOAOTOOOAOCTTACCAAAAOTCCAOCrOaAOCrXATCAOTGOAAACaA^ 

Z III ^ Tyr Olu Txp GU L.U Thr -« ^ "^'^ 

10.1 OAC 0« OCC OOO OCT OOC ACC ATA CCG OAT OCA CAT OAT CCC AGC AAO TCO CAC OCT CCA XI40 

Z Z Z KX. Oly Al. Oly Thr II. Pro A.p AU Hi. A.p Pro S.r Ly. S.r Hi. Al. Pro 3.0 
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I Ml TTT ATC3 CTC ACT ACO GAC CTO OCO CTO COC ATO QAC CCT OAT TAC OAA AAA APT TCT CCA 1200 
3il 9hm H«C L«U Thr Thr Amp L.u Al* L«u Arg Mat Amp ?vo Amp Tyv Olu Ly». rl. 3,r ^^q 400 

1201 COO TAC TAT OAA AAC CCT OAT OAO TTT OCA GAT OCT TTC OCO AAA OCA TOO TAC AAA CTO ia«0 
401 Ara Tyr Tyr Olu A.n Pro Amp Olu Ph. Al« A.p Ai< Ph« AU by. Al« Trp Tyr Ly« L.u 420 

12<1 ACA CAC AGA GAT ATO GGA CCA AAO OTO COC TAC CTO OOA CCA OAA OTO CCT CAO OAA OAC uao 
4H Thr Him Arg Asp H«ti Oly Pro Ly» V«l Arg Tyr L-u Oly Pro Olu V.l Pro Oln Olu A«p 440 

1321 CTC ATC TOO CAA OAC CCT ATA CCA OAT OTA AGC CAT CCT CTT OTA OAC OAA AAC OAT ATT 1380 
44 1 Uu tl« Ttp Oln Aap Pro lie Pro A.p V*l Smz Him Pro L«u V«l Asp Olu Amn Asp Il« 4ca 

CAA GGC CTA AAA GCC AAA ATC CPO OAA TCO OOA CTO AGO OTA AGC OAO CTO CTA ACC ACC 1440 
4C1 Olu Oly L«u Ly» Aim Lym Il« Leu Olu Ssr oly Lau Thr V.l Smr Olu Lau V.l s«r Thr 480 

1441 CCA TOO OCT TCT OCA TCT ACT TTT AOA AAC TCT OAC AAO COC OOC 0(JT OCC AAC OCT OCA isoo 
411 Al* Trp Al* S.r Al* 5«r Thr Ph« Arg A.n S.r A.p Ly* Arg flly Cly Al. A.n Oly Al« SOO 

ISOl CCT ATA COA CTO OCC CCA CAA AAA OAC TOO OAA OtA AAC AAC CCT OUJ CAA CTT OCC AOd IStQ 
SOI Arg lU Arg L«u Ala Pro Oln Ly. A«p Trp Olu V«l Aan Aan Pro Oln Oln L.u Ala Arg $20 

IS«1 OTA CTC AAA ACA CTA CAA OCT ATC CAO GAO OAC TTT AAC CAfl OCO CAA TCA GAT AAC AAA 1S30 
S31 V«l L.U Ly* Thr L«u Olu Oly II. Gin Clu Asp Ph. Aan Oln Al* Gin S.r A.p A.n Ly. S40 

Ifiai GCA CTA TCG TTO CCC OAC CTO ATT CTO CTO CCC OOC TOT OCO OCT OtA OAA AAA CCT OCA ICOO 
541 Al. V«l S.r Lau Al. Aap Lau II. V.l Lau Al. Oly Cya Ala Oly Val Olu Lya Ala Al* SCO 

. ICai AAA OAT OCT OOC CAT OAO OTO CAO OTO CCT TTC AAC CCO OGA CGA OCO GAT CCC ACC OCT 1740 
SCI Lya Aap Ala Oly Hia Clu Val Oln Val Pro Pha A«n Pro Oly Arg Ala Aap Ala Thr Al* sao 

1741 CAC CAA ACC GAT GTO OAA CCT TTC CAA CCA Cia GAG CCA OCO OCT OAC CCC TTT AQA AAC ISQO 
sai clu Oln Thr A«p Val oLu Ala Pha Clu Ala Lau Olu Pro Ala Ala Aap Oly Pha Arg Aan coo 

liOl TAC AIT AAA CCO GAfl CAT AAA OtATCCOCTGAOOAAATOCTCOIAQACOOOOCOCAacrT ItCO 
SOX Tyr :i« Ly. Pro Olu Hi« Ly» Val Sm= Al* Olu Olu H.c L.u Val Ajp Arg Al* Oln Leu CIO 

lt<l CTO TCG CTT TOO GCA CCA CAA ATO ACT OCT TTO OTA OGC OOT ATO CST OTA CTO OCC ACC 1S20 
«21 Leu Ser Leu 9er Aia pro Olu Hoc Thr Ala Leu Val Oly Oly Nsc Arg Val Leu Oly Thr C40 

19ZI AAC TAC OAC OOT TC3 CAC CAT GGA CTO TTT ACA AAT AAA CCO OCT CAC CTA TCC AAT OAC 
C-ll Aan Tyr Aap Oly Sar Oln Hia Gly Val Pha Thr Aan Ly* ?ro Oly Oln Leu Sar A«n Aap CCO 

1911 TTC TTT OTA AAC CTO CTA OAC CTC AAC ACT AAA TOO OOA OCC AOC OAT OAA TCA CAC AAA 3040 
««1 Phe Pha Val Aen Leu Leu Aep Lau A«n Thr Lye Trp Arg Ala Bar Aep Olu «er Aap Ly* «io 

2C41 CTT TTT CAA CGC ACA 6AC TTC AAA ACT OOC CAA OTA AAO TOO ACT OOC ACC COO OTA GAC 2100 
cai v.l Phe olu Oly Arg A«p Ph* Ly* Thr Oly olu Val Lya Trp Jer Oly Thr Arg V*l Aap 700 

3X01 CTOATCTTCOGATCCAATTCCGAOCXAAaAOCCCrCO«aAAOTOTACaOCTOTOCAaAT aiCO 
701 Leu 11* Ph* Oly Ser A*n a*r Olu Leu Arg Ala Lau Ala Olu Val Tyr Oly Cy* Ala Aap 730 

31CI TCrCAAaAAAAOnTOTTAAAaATTrrOIOAAOOCCTOOOCCAAAOXAATOaACCtOGAC 2230 
731 Ser Olu Olu Ly. Phe V.l Lya Aap the Val Ly* Al* Trp Al* Ly* v.l Mee Aap Leu Aep 74 0 

2221 COO TTT OAT CTO AAA TAA 333I 
741 Arg Phe Aap Leu Lye Knd 74 C 
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